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1.0 INTRODUCTION 
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2.0 TECHNOLOGY 

The SPDI Box is built with Macro Cell Arrays. It should 
require no RAMs, and uses little more than the standard bus 
interface chipset. It should fit on a fraction of a board. 



3.0 FUNCTIONAL DESCRIPTION 

The SPDI Box sits on the Dolphin Bus and recognizes bus 
requests for memory having address bit 9 a one. It will also 
recognize requests in I/O address space for its internal registers 
and status. Its most 
requests to memory. The 



pages, and tag them with 
pages actually live in 
physical page with bit 9 



commonly used function is with "write" 
processors will detect "shared writable" 
a one in bit 9 of their addresses. These 

physical memory in the corresponding 
a zero. 



A "write" request to an address with bit 9 a one is 
acknowleged by the SPDI box if it is not busy. The SPDI box will 
then re-issue a write request, deleting bit 9, and sending along 
the data word(s) received. (At this time no use is known for 
other than single word requests.) It then issues "cache zapper" 
bus requests to all other active processors in the system. These 
are bus cycles which request the processor to kill any copies of 
the word(s) it may have in its cache. The SPDI box will retry 

until accepted, or a timeout occurs. The_£osrtive -n, 
_by a processor of a _.^SPSJL...J5tBB~~S^^E^Z^^Ii- . 



these "zappers 
acknowledgment 
sufficient 



zap Its ca^cHibefore issuing any further bus memory requests. The 

nowIegiH. 



gpDXnHXr~retnraTil^™~KIsy"~un"t fj™5Xr~^pgYs" 



An internal control register in I/O space identifies the 
active processors, and the requesting processor identifies itself 
in the ID field of each request. 

The only "read" request handled is an "interlock read". 
Processors will suppress bit 9 from the address on other reads. 
Upon receipt of an "interlock read", the SPDI box re-issues it 
with bit 9 suppressed, in the name of the requesting processor. 
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This arranges that the data will return directly to the requesting 

processor. The SPDI box will acknowlege the requesting processor 

immediately, unless it is busy, and will keep trying the memory 

request until accepted or a timeout occurs. The SPDI box will 

^ / then enter a "selective busy" state, in which it appears busy to 

f^^^ L all memory requests except those from the processor which issued 

^D^^o* jfT the "interlock read". The SPDI Box will remain in this state 

./^ p.* i until the receipt of a "write release" request, which will be 



^v 



\^ (^handled in the same fashion as described for "write" requests. 



4.0 APPLICABILITY 

Only one SPDI box may be active in the system at one time. 
This is necessary to insure that data written under one "critical 
sequence" interlock will be seen by a subsequent processor seizing 
that interlock. For reliability reasons, it seems desirable to 
have a "hot standby" unit. For this reason, it may be desirable 
to combine the functionality defined herein into the Dolphin Bus 
Repeater , since the most common multiprocessor configuration (2 
processors) has exactly 2 bus repeaters. 



5.0 COST 

It is currently estimated (by John Allen) that the SPDI box 
will take 17 or 18 MCAs, and 1/2 board. If it is combined with 
the bus repeater, this drops to 6 or 7 MCAs. 



6.0 OPEN ISSUES 

Should the SPDI box contain some sort of interprocessor 
communication facility? 
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TO: Dolphin Project List 

Jim Fleming 
Peter Hurley 
Ron McLean 

****REVISION 2**** 

SUBJ: Shared Pages in Multiprocessor Dolphin Systems 

This revision (2) corrects some problems discovered in 
revision 1 in the area of avoinding a deadly embrace, and assuring 
data modified under an interlock will be propagated before another 
processor can seize the same interlock. 

This memo is to clarify and refine the scheme for handling 
shared writable pages in multiprocessor Dolphin systems which was 
the subject of the John Allen memo of 16 Jun 78 (file: MB0X4). 
It also represents a revision and improvement on the scheme 
described in my memo of 18 Jul 78 (file: SHARED). This new 
scheme includes suggestions made by Mike Newman and a 
brainstorming session in my office with Bosack, Guglielmi, Allen, 
Lewine, McLean and Murphy. 

The problem to be solved is that of shared writable pages. 
These are defined as pages which have at least one processor which 
has written the page, and at least one other processor making any 
references to the page. Tnis state is to be carefully 
distinguished from the state where many processors have the 
privilege of writing, but none has yet done so, or the case where 
only one processor has yet exercised the option of referencing the 
page, even though several have the privilege. 

It is a goal of the system to insure that words written by 
one processor will be seen by the other processors after a "short" 
delay, even though for speed reasons, these pages wish to be 
cached by all processors. Unless an interlock instruction (such 
as AOSE) is used, no guarantee can be made that simultaneous 
updating of a word will work, or that one processor will 
"instantaneously" see the results of another's writing. It is, 
however, a goal that shared data updated by one processor which is 
in control of an interlock will be seen only in updated form by a 
second processor when that processor gains possession of the 
interlock. 

To arrange that data integrity is maintained by the hardware 
(and microcode), it is first necessary to detect that a page has 
become shared writable. This is done with the help of several 
bits in the Core Status Table (CST) for the page. The bits in the 
CST are one for each processor in the system, currently a maximum 
of 4; along with the existing "modified" bit. Each "processor" 
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I 

A^bit'o? ?hl phys?cal^addrP?S' hasTnade any reference to the page, 
an optional "shLerPagesDj'rinKSr^'" iJ"^^ in conjunction w??h 
Dolphin BUS. .he ru-?io^r^.-^!f-^ ^'^l^^^.^.,^ 

^CBti^ or---/Ls?f -c^j -?^-s Lc^i^?i-r:fth-?i 

I is an ' nte lo^k^d read ir??: l' "''" '" ^'^ "''• "^^^^ referencl 

already shar'^d-Srl^ablTiron: Tn^'wh^ch -.ore^'rhin^'o"''^ "'"^' '' 
refprAni-o Ki-4. J- ^ wnicn more than one processor 

clir.T.l ^icrocole ^S^la ^"' '''^ "-^ifiea" bit is set." I^fhls 
'• tllii til^l iTrV^n T.lVy"' ''''^^' ='""<' " '»•- P«9e 

'• woji? La":.?j: irLc" ^'°""°' "^^ "' °' '-^^ "^ 

4-w Z. 1 "oairied bat set, ana exactly one use bit set bu^ nni- 
the bit for the processor doing the update. This is thP r.ll J 

?hlred''1n'?h'^^^ "^^^^^" ^^ anoth?r'p;oces:;r ' and "o^ %et 
snarea. m this case, the following actions are taken: ^ 

^* wSri?^ '^^ ^^"^ update, no modification is made to the CST 
2. A page fault is taken to the software. 

'• sel Tthe'cs?'"aa^"%' CST update interlock and looks to 
"hf.no J T ^?.^ ^'^^^ ^^ s^t to a combination called 
hf^tJ H'^" state", (which is detectable by microcode to 
block references to the page). ^^^^^-ae to 

;,rfinn^ f ^ ' ^''"'^°''« ^^^^ 9°^ here first, and no further 
?S s?ep 5, "^"^^^^y- ^^^^^^e the CST interlock and skip 

If not, set the transition state, write back thp r^T 
word, and release the interlock. ^^^ 

4. A req-uest is issued to the processor whose reference bit 

do%h2 IW'^.''^'': ^^^^ '^^'^uest causes that processor Jo 
do the following items: 

Issue a cache sweep to validate core on the subject 



1. 



v>.^->1 (' 2 . 









7r ■. 



physical page. 

Issue a new instruction which scans the page table 
cache, setting the CST update bit on all entries 
pointing to the object physical page. 
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I 

3. Upon completion of both these items, doing a CST 
update, changing the "age" field to the current time, 
and oring-in the processor use bit of the processor 
which initiated this request. 

I 5. Go off and do something else until a check of th^ CST 

word for the object page returns to legal state. 

roc.,™*'^^;?^*^^ processor detecting the above transition is able t-o 

n^opSn^ i?''S"-^ ""^'"^ ^^^ ^^"^-' co'^'^ect data for the object 
page will all be in main memory, and a mechanism described below 
will insure continued integrity. Since no successful reference 
was made to the object page before the above sequence wal 
executed, the first retry of the failed reference will cause a CST 
upaate reference to be made, where it will be determined that fhe 
page is now shared writable. 

wnnin^ l^ll^ processor tripping over the same problem as above 
woula either have its microcode CST update find the page in 
tronsition state", and just wait for it to become unblocked, or 
might 3ust catch the page after the second processor detecfed the 
transition and before it set the "transition state". In that 
case, the third processor will hit the CST update interlock 
mentioned above, and will either take over the transition action, 
or yield to the second processor, and wait. 

J-f a processor attempts a CST update and finds that it is the 
tirst to attempt to write a page that at least one other processor 
\$' ^^ reading, the transition to shared writable is much simpler. 
Tne microcode simply ors-in the processor reference bit and the 
moaified bit, and writes back the CST word. It then sei-s bit 9 in 
j^' Its page table cache entry, just as if it had discovered that t-he 

page was shared writable before it came along. 

<^/\ir This works because any copies of words from this page in any 

.A^^'kf J\ "^ k'""^^ ^^ correct, since no one had modified them before, and 

^.J-xrlTl S"':^sequent modifications will be handled by the "shared pages 

■> it^i itt integrity" box. Processors not having write privileges on 

^ '^y" I u ^ P^^^' ^^ "°^ y^^ having written the page do not set bii- 9 of 

J'^J I ^fjfi'^^address, and do not set "written" in their page table cache. 

This distinction allows pages to have writable access from some 

processes and not from others. 

The effect of setting bit 9 of a physical address is to put 
the page in "wr itethrough" cache status. This means that each 
time a word in such a page is written, it will be both written 
into the cache, and sent to memory. Although during ordinary read 
operations, bit 9 is suppressed from the address sent <-o the 
memories, it is included during write operations and the read 
portions of interlock reads. This is where the "shared paqes da*-a 
integrity" (SPDI) box comes in. 
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^.nrriJ^'"^^''V^ memory is limited to' 2 to the 26th words (67 Million 
^^ ?S^*c.r^? Reference above this limit, (address bit 9 on), refers 
i? no.^T ^°^-..f" '^^ ^^^^ °f ^ "«^ite" operation, the SPDI box 
less Mt 9 ^^''inl^^K^P'•^' ^""^ fe-issue it to the same address' 
othtr LI- ^^ ^''^" '^^"^ ^ ^^^'^^ zapper" request to each 

other active processor in the system for the word (or words) in 
?ransi^?,'o "'^^ J<=c^Ptance by the processor of the cache z^opfr^is 
^hf2! S" /"^i^r^tes that the receiving processor will delete 
these words from ats cache. Thus, any write in a shared pag- will 
soon assure that no old copy resides in any cache. 

no^ hncf,^^ ""^f? °^ ^" interlock read operation, the SPDI box, if 
Jess M?^9 ""^i •f^'^KP'- '''.^"'^ re-issue it to the same add;ess, 
less bit 9. It will be issued in the name of the originaMnq 
|pm^;^''''-.r ^'^^ 'result will go directly to that processor The 

(1^ ^^'^ niJ^ "f"*^'" u"^^ *° ^^^ ^""^ ""'^^ release operations. 

It IS not clear whether this is a "interlock busy" or ordinary 

busy" response.) These will be treated the same as write 

operations explained above. wnre 

Explicit interlock instructions (like AOSE) are handled as 
roiiows. If an interlock instruction is encountered by the EBOX, 
It will first issue an interlock-read to the MBOX. This operation 
implies a write test by the MBOX, since writing privileges are 
necessary to do this type of instruction. The MBOX will only 
execute an honest memory interlock function if the page is marked 
writethrough" in the page table cache by virtue of bii- 9 being 
one in tne physical address, or if the page is marked uncached. 
Otherwise, if the page is marked "written" already, the read will 
be done normally from the cache (or memory if necessary). If i-he 
page is either marked "writable but not written", or is nnt in the 
page table cache, the normal paging algorithm will be clanked, 
which will cause, in the normal course of events, an interlocked 
update of the CST. Thus, the first processor attempting to do an 
interlocK on a given word either finds the page already shared, in 
which case the memory interlock is done, or finds that the page is 
not shared, or will execute the algorithms above if it determines 
that this is the transition to shared writable status. 

If the page was not shared, then the interlock funci-ion is 
irrelevant. The next processor to attempt to use this interlock 
will trip over the transition to sharing. A processor which 
detects the transition to sharing puts the page into shared status 
Joetore it can read the word, and will find a good copy in memory. 

Since the interlock read goes through the SPDI box, <-he 

following deadly embrace is avoided: Processor 1 has <-he 

interlock and is attempting to release it with a SETOM. Processor 

2 IS testing the interlock. If processor 2 could do the interlock 

read around the SPDI box, it could lock-up the memory interlock 

when the SPDI box was attempting to do the SETOM write. The SPDI 

box would wait patiently for the interlock to free-up, which it 
would never do, since the interlock write is camping at ^he qate 
for the SPDI box, which is busy. 
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In order to assure that data .inodified under the control of an 
interlock will be seen in correct form by a subsequent owner of 
the interlock, only one SPDI box can be active in a system. 
Otherwise, the last data write may still be rattling around the 
buses after the interlock is released and reseized. 

Bus repeaters must be prepared to save write requests which 
have been rejected for interlock reasons, and let other requests 
go by, retrying the interlocks occasionally. The space in the bus 
repeater buffer used by an interlock request must be saved until 
the write release comes by, or else a deadly embrace similar to 
the one described above regarding the SPDI box may occur. Thus 
the bus repeater may be able to accept only write release 
operations at certain times. 

I currently believe that this scheme hangs together. The 
loose ends are: 

1. How the messages to flush page table caches and sweep\ 
data caches are handled J 

2. How KI paging makes use of all this, since a CST isj 
needed . J 

3. Vvhat kind of busy responses to send from the SPDI box^' 
I solicit your comments. 
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TO: Dolphin Project List 

Jim Fleming 
Peter Hurley 
Ron McLean 

**** REVISION 1^**** 

SUBJ: Shared Pages in Multiprocessor Dolphin Systems 

This memo is to clarify and refine the scheme for handling 
shared writable pages in multiprocessor Dolphin systems which was 
the subject of the John Allen memo of 16 Jun 78 (file: MB0X4). 
It also represents a revision and improvement on the scheme 
described in my memo of 18 Jul 78 (file: SHARED). This new 
scheme includes suggestions made by Mike Newman and a 
brainstorming session in my office with Bosack, Guglielmi, Allen, 
Lewine , McLean and Murphy. 

The problem to be solved is that of shared writable pages. 

are defined as pages which have at least one processor which 

written the Daae . and at Ipast- nn*» ni-hor -nmnaacnr maHr^/x -»»,tt 



These citt; ueiinea as pages wnicn nave at least one processor which 
has written the page, and at least one other processor making any 
references to the page. This state is to be carefully 
distinguished from the state where many 



is to be 
processors 



aistinguisned from the state where many processors have the 
privilege of writing, but none has yet done so, or the case where 
only one processor has yet exercised the option of referencing the 
page, even though several have the privilege. 

It is a goal of the system to insure that words written by 
one processor will be seen by the other processors after a "short" 
delay, even though for speed reasons, these pages wish to be 
cached by all processors. Unless an interlock instruction (such 
as AOSE) is used, no guarantee can be made that simultaneous 
updating of a word will work, or that one processor will 
"instantaneously" see the results of another's writing. 

To arrange that data integrity is maintained by the hardware 
(and microcode) , it is first necessary to detect that a page has 
become shared writable. This is done with the help of several 
bits in the Core Status Table (CST) for the page. The bits in the 
CST are one for each processor in the system, currently a maximum 
of 4; along with the existing "modified" bit. Each "processor" 
bit is a one if that processor has made any reference to the page. 
A bit of the physical addressing space is used in conjunction with 
an optional "Shared Pages Data Integrity" box, which goes on the 
Dolphin Bus. The function of this bit (bit 9) will be explained 
below. 

Each time a change or update is made to a processor's page 
table cache, or a page table cache entry is encountered with its 
CST update bit on, a reference is made to the CST. This reference 
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is an interlocked read-write sequence. A physical page which is 
already shared-writable is one in\hich more than onf p^ocesso? 
reference bit is set, and the "modified" bit is set. In this 
case, the microcode would 

-1. Or-in bit 9 into the physical address stored in the paae 
table cache for this entry. 

2. Or-in a one bit into its processor use bit of the CST 
word, and write it back. 

^>...J?^^^ ■f^t-,^'^^ ^^^^^ ^" ^''^^^^ ^ P^^e needs to be changed to 
h^2 !k ^^i^5^^„^^?^"^• "^^^ fi^st is that the CST word is read 
had the 'modified" bit set, and exactly one use bit set, but not 
the bit for the processor doing the update. This is the case of a 
page which has been written by another processor, and not yet 
shared. In this case, the following actions are taken: 

1. During the CST update, no modification is made to the CST 
word. 

2. A page fault is taken to the software. 

3. The monitor goes thru a CST update interlock and looks to 
see If the CST "age" field is set to a combination called 

transition state", (which is detectable by microcode to 
block references to the page) . 

If so, someone else got here first, and no further 
action is necessary. Release the CST interlock and skip 
to step 5. *^ 

If not, set the transition state, write back the CST 
word, and release the interlock. 

4. A request is issued to the processor whose reference bit 
was on m the CST. This request causes that processor to 
do the following items: 

1. Issue a cache sweep to validate core on the subject 
physical page, 

2. Issue a new instruction which scans the page table 
cache, setting the CST update bit on all entries 
pointing to the object physical page. 

3. Upon completion of both these items, doing a CST 
update, changing the "age" field to the current time, 
and oring-in the processor use bit of the processor 
which initiated this request. 
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5. Go off and do something else until a check of -the CST 
word for the object page returns to legal state. 

When the processor detecting the above transition is able to 
resume the process causing the fault, correct data for the object 
page will all be in main memory, and a mechanism described below 
will insure continued integrity. Since no successful reference 
was made to the object page before the above sequence was 
executed, the first retry of the failed reference will cause a CST 
update reference to be made, where it will be determined that the 
page is now shared writable. 

A third processor tripping over the same problem as above 
would either have its microcode CST update find the page in 
"transition state", and just wait for it to become unblocked, or 
might just catch the page after the second processor detected the 
transition and before it set the "transition state". In that 
case, the third processor will hit the CST update interlock 
mentioned above, and will either take over the transition action, 
or yield to the second processor, and wait. 

If a processor attempts a CST update and finds that it is the 
first to attempt to write a page that at least one other processor 
is reading, the transition to shared writable is much simpler. 
The microcode simply ors-in the processor reference bit and the 
modified bit, and writes back the CST word. It then sets bit 9 in 
its page table cache entry, just as if it had discovered that the 
page was shared writable before it came along. 

This works because any copies of words from this page in any 
cache must be correct, since no one had modified them before, and 
any subsequent modifications will be handled by the "shared pages 
data integrity" box. Processors not having write privileges on 
this page, or not yet having written the page do not set bit 9 of 
their address, and do not set "written" in their page table cache. 
This distinction allows pages to have writable access from some 
processes and not from others. 

The effect of setting bit 9 of a physical address is to put 
the page in "writethrough" cache status. This means that each 
time a word in such a page is written, it will be both written 
into the cache, and sent to memory. Although during read 
operations, bit 9 is suppressed from the address sent to the 
memories, it is included during write operations. This is where 
the "shared pages data integrity" (SPDI) box comes in. 

Ordinary memory is limited to 2 to the 26th words (67 Million 
words). A write reference above this limit, (address bit 9 on), 
refers to the SPDI box. It will accept the write operation, if 
not busy, and re-issue it to the same address, less bit 9. It 
will then issue a "cache zapper" request to each other active 
processor in the system for the word (or words) in question. The 
acknowlegement by the processor of the cache zapper bus 
transaction indicates that the receiving processor will delete 
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these words from its cache. Thus, any write in a shared page will 
soon assure that no old copy resides in any cache. 

^r^n^^'^^^^^x^ interlock instructions (like AOSE) are handled as 
loiiows. If an interlock instruction is encountered by the EBOX, 
It will first issue an interlock-read to the MBOX. This operatioA 

necesIarv\o*'di''^h,-^^f ^^^^^ ''^°''' ^^^"^ ^"^^^^^9 privileges are 
«!S!^? ^ \.^° ^^^^ ^^^ °^ instruction. The MBOX will only 
execute an honest memory interlock function if the page is marked 
"writethrough" in the page table cache by virtue of bit I beinq 

0?heriise ff ?Srif^^ address, or if the page is marked uncached! 
Otherwise, If the page is marked "written" already, the read will 
be done normally from the cache (or memory if necessary) . If the 
S^r ^? ti^^^'' "^t^^^^ "writable but not written", or is not in the 
ShfJh lf^^^ cache, the normal paging algorithm will be clanked, 
which will cause, m the normal course of events, an interlocked 
update Of the CST. Thus, the first processor attempting to do an 
ih^^H^'' °!?v.^ ^^''^'' ''°''^ either finds the page already shared, in 
which case the memory interlock is done, or finds that the page is 
^h!/^S- • "U^^^ execute the algorithms above if it determines 
that this is the transition to shared writable status. 

If the page was not shared, then the interlock function is 
irrelevant. The next processor to attempt to use this interlock 
^tl J^'^L'"'!'' ^^^ .transition to sharing. A processor which 
detects the transition to sharing puts the page into shared status 
before it can read the word, and will find a good copy in memory. 

I currently believe that this scheme hangs together. The 
loose ends are: -^ y^^wci.. xue 

1. How the messages to flush page table caches and sweep 
data caches are handled) 

2. The location and number of SPDI boxes 

3. How KI paging makes use of all this, since a CST is 
needed . 

I solicit your comments. 
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references To ^ Ih^ pace Thi^^f^" P-:ocessor making any 
aistinguished from the bI^;» 3hi "'*^ " *° *>* carefully 
privilege of wrl?i„a buf S^L k *" .'°!"y processors have the 
only onl processor KIs yet ex«ci«/?h»''^"^ •^°' °/ *''^ ="« ""ere 
page, eveS though s«L^!*h^Je"?irpri!;!leS:°" °' "'"-^l-^ the 

one pr"cesslr'wil°be's«n^bv^throiE^"" *"" ""^^^ """*" "^ 
-instantaneously, see the'res^^ts 'of "another ■s°«it!ng!"='" ""' 



reference to ?he page ^ '* "" P'°«^=" "as made any 

2if Mr??L^;?ic2s"Srij%?ti^;?j„rto lit'-iSe ^?!t;:^'£ " 

and any other processor reference bit? arl on, then the pige Is 
transitioning to the shared writable state Tfi^Tt 5 2« Jt 
Physical address was a one, the ^Igels^al^^ly- shared "„?i?Ible! 
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of the bus physical addriss if fJlA ?"'='' '"^^ST word. Bit 9 
read refe'rences. """"ss is forced to zero by the 1IB0X on all 

shared-'SruLTe"^!!^: °it' ll^ttn tllVr *?-.*"«"!<-» to a 
monitor. If the cause of fh=f?-.? ^'^^ ^'"^'^ Interrupt to the 
attempting to rllerenci a Loe «hJ i'n ^\*'"" "" P'o^ssor is 
processor? the foLowtnraftfo^I IlSsf be'takSn^ "''""" ^^ """^ 

'• procelsor"'!s "^a iS'""' *''^ reference bit for this 
'■ ?n?rtL" ta ln?fy*^%S?!-=?J l^l --"S must be ored 

" i:Sa-r-K?F-"- 

foHo^pg-stjr^^^^^^^^^ 

?a??d:??i| %-? ' €-V e""- -«ron%h'e^"1ag^- 
InrsriylS £""- --Tp^'es-lIS -in-?"l? 

*• SI ^cSS;i%^?LJ^Sl'?iSS%:*:p\^i-i;: --^r "-t confirm 
need not stop its own%?:c%'lslng'S^r?nr?gir?!:r'"- "" 

'■ =achlf'2^d''"Sir°Lr'the'r^^?- ""^*' '^^ P»9e table 
asked for! ^ confirmation of the sweep, if 

- '■ ZilLl^Mil^^^^d anrihe"?lt?e:^iJ?'° --- ^ 

2. The Old copy Of the CST entry would be sived bv ^h^ 

?i?s°^S-J f-om'Hh: ^?L?o^- oi-" -"""^ " «sAuish^ 

^" ?he"lMf """ '"° "" » °' the Physical page address In 
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'* ?Jlf?ISc%"?i?r\e'n?iJ ^JL'r^?".?^\-^/--" - their 
caches. teiimg them to flush their page table 

5. This processor must flush its page table cache. 

» -6. No cache sweeps are necessary since all fl«fa -«« *.«^v 

and caches are correct. since ail data 4n both core 



8. Another processor detectina this ei-^i-^ 

simultaneously cannot in fLtdi«of-® aore-or-less 
of the "written" h??L i^ Z ^?' ^^"^® ^he oring-in 

time a word in such a page is «r it tin f J ^i i T S """"^ t^^t each 

Jpe'Lt-s.=?r^ ifibse? r e f-" --""- 
memories. It Is included during „?» address sent to the 

the "Shared pages IStil^teg'^IJ^S 'ilUh'^inoTel- ir,'^"'' '' """^ 

words?!''^"r^w?!?e°^?eL%e\^^i'r.ov%° ?hJ^ l1^i?"Ur1s%'^'t","^T 
nof "L^?, ^-Inl^'-Je^fssue^ir iJ ?."T '^!«i?' °P«atLl°"ii 
Will theK'lssue a -caJhezapper- *rLues? '?o"faih''o^"' '• •" 
processor in the system for the word Inr ^.r^J^ °^^". *=tive 
acknowlegement by the procesLr of fh!^^' i" ''"'^="°"- The 
transaction indicates that the rlL<^ "*'=''* ^^^P" bus 
these words from l?s cache tJm= i™*".?^ processor will delete 
soon assure that 'nlMrl^y ^sfdefL^ircaShe'.^'"^' ^''^ ""^ 

Explicit interlock instructions (like aose^ aro »,»«>»i ^ 
f?^i?rf*.- 'f ^'^ interlock instruction is encounte?el bS Jhe^BOx" 

SSflh ^??J%aSs%=''n„ J£f „Sr°i^^^ '"^^"^ ?'"*""" 2LPbr^l^SKe*S: 
update Of tte CST iS^» fhf^° "I" °* events, an Interlocked 
interlock on a giv;„ ^rl'elthef i iLf^n"^" attempting to do an 
«hich case the LntrrinlVll^ViiroL?l/ni.l'lllt\Tplti H 
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JS«4.^Jh?!*^^ ""iJ^^^ execute the algorithms above if it aetermines 
that this IS the transition to shared writable status. 

irrelevaS^ ^^Th^.^'^nov?^ ^^^^^^' t*^«" the interlock function is 
iill triS ;v.r Jho t^ P^jcessor to attempt to use this interlock 
Seiects the ?ransftinn^?r«i°^ ^° sharing. A processor ^hich 
Slfore it Ln ?2J<5 in" ^° f^^'^'"^ ?"^^ ^^^ P^^e into shared -status 
*>e£ore at can read the word, and will find a good copy in memory. 

loose^ Indr^beino^innh''*'^^ ^^'^ ^^^^"'^ *^^"9^ together, with the 
Aoose ends being how the messages to flush page table caches and 

boxS ^i%o??^?? r' *^""''"''' ^^^ ^^^ location^and nimber o? SpS? 
Doxes. I solicit your comments. 



